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Abstract — In this paper we develop scheduling policies that 
maximize the stability region of a wireless network under the as- 
sumption that mutual information accumulation is implemented 
at the physical layer. When the link quality between nodes 
is not sufficiently high that a packet can be decoded within 
a single slot, the system can accumulate information across 
multiple slots, eventually decoding the packet. The result is an 
expanded stability region. The accumulation process over weak 
links is temporally coupled and therefore does not satisfy the 
independent and identically distributed (i.i.d) assumption that 
underlies many previous analysis in this area. Therefore the 
problem setting also poses new analytic challenges. We propose 
two dynamic scheduling algorithms to cope with the non-i.i.d 
nature of the decoding. The first performs scheduling every T 
slots, and approaches the boundary of the stability region as 
T gets large, but at the cost of increased average delay. The 
second introduces virtual queues for each link and constructs 
a virtual system wherein two virtual nodes are introduced for 
each link. The constructed virtual system is shown to have the 
same stability region as the original system. Through controlling 
the virtual queues in the constructed system, we avoid the non- 
i.i.d analysis difficulty and attain the full stability region. We 
derive performance bounds for both algorithms and compare 
them through simulation results. 

Index Terms — Mutual information accumulation, backpressure 
algorithm, delay analysis, virtual queues. 

I. Introduction 

Because of the broadcast nature of wireless communica- 
tions, multiple receiving nodes may overhear the transmission 
of a transmitting node in a wireless network. Therefore, a 
packet can be routed through different routes to its destination 
node. This "multiple-route diversity" can be exploited to 
improve various measure of network performance including 
throughput, average delay, and the probability of transmission 
failure. 

Different routing and scheduling policies have been pro- 
posed to exploit the "broadcast advantage" of the wireless 
medium. Reference Q proposes ExOR, a packet-level op- 
portunistic routing protocol to exploit high loss-rate radio 
links in wireless networks. Reference |2| proposes a proactive 
link state routing protocol that improves the forward nodes 
selection in |fl~). Reference characterizes the maximum 
network capacity region when packet-level opportunistic rout- 
ing is exploited, and proposes a routing algorithm named 
DIVBAR to stabilize the system. DIVBAR adopts the idea 
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of backpressure algorithms first proposed in 0, 0. Ref- 
erences [Q~)-0 discuss how to select a forwarder based on 
the feedback information sent from the receivers that have 
successfully received the packets. With a similar maximum 
weight scheduling idea, JS] analyzes the optimal information 
theoretic based cooperative scheduling in a diamond network. 
The system is assumed to be able to adaptively select the 
optimal encoding-decoding scheme such that any desirable 
transmission rate vector within the corresponding capacity 
region can be achieved in every slot. This "fluid model" is 
not quite practical. However, because it is simple to analyze, 
it has been widely adopted in the literature. 

The policy space explored in the above papers, in particular 
those having to do with wireless Hi-El, (6]> assumes that 
packet decoding is carried out within a single slot. This 
means that decoding depends only on current control actions 
and channel quality. However, there are many physical layer 
techniques, such as energy accumulation and mutual infor- 
mation accumulation (MIA), that do not fit this assumption. 
Such techniques do not require decoding to occur within 
a single slot but rather allow the receivers to accumulation 
observations across multiple time slots before decoding. This 
allows the network to exploit weak radio links more fully, 
thereby increasing system throughput. 

In this paper we focus on mutual information accumulation 
and, while there is some prior research on resource opti- 
mization in networks using MIA Gl-0, that work does not 
consider questions of network stability and protocol design. 
Our objectives in this work are therefore two fold. First, we 
want to characterize the maximum network capacity region 
when mutual information accumulation is implemented at 
the physical layer. Second, we aim to design joint routing 
and scheduling algorithms that stabilize any point within the 
capacity region. 

To make the concept of mutual information accumulation 
more tangible, it helps to contrast it with energy accumulation. 
In energy accumulation multiple transmissions are combined 
non-coherently by receiving nodes. This is usually enabled 
by using space-time or repetition coding ifTOl - lfTll . Mutual 
information accumulation is more efficient, the difference 
between the two being explained in Q. Consider a pair 
of senders transmitting information over two independent 
additive white Gaussian noise channels to the same receiver. 
Assume transmitter power P and channel noise power N. 
Energy accumulation corresponds to the scenario when each 
transmitter sends the same codeword. When the decoder uses 
maximum ratio combining, a throughput of | log(l + ^S-) bits 
per channel use can be achieved. With mutual information 



2 



accumulation, independent parity symbols are sent, and the 
system can achieve 2 x ^log(l + -£) bits per channel use, 
which outperforms energy accumulation. It has been noted 
in that for Gaussian channels at low signal-to-noise ratio 
(SNR) energy accumulation is equivalent to mutual infor- 
mation accumulation because capacity at low SNR is linear 
in SNR. Mutual information accumulation can be realized 
through the use of rate less (or fountain) codes 13, fH), OH- 

Similar to Cll— O, we assume our system operates at packet- 
level, and each active transmitter transmits a packet in each 
slot. Different from the probabilistic channel model discussed 
in these references, we consider the scenario where the channel 
state varies slowly, so that link conditions can be assumed to 
be static over a long transmission period. Under this scenario, 
for a weak link whose link rate is below the transmission rate, 
a packet cannot be successfully delivered across the link in 
any single slot, even with repetitive attempts in different slots. 
Thus, the corresponding receiver can never decode and become 
a forwarder under the schemes discussed in Hi-El- However, 
when rateless codes are used by the transmitters, although the 
corresponding receiver of the weak link cannot decode the 
packet within a single slot, it can store that corrupted packet 
and accumulate more information in later slots. Eventually, 
a packet can be successfully delivered across a weak link 
after a number of transmissions by accumulating information 
across the slots. Thus, weak links can still be utilized in slowly 
changing network environments. Compared with opportunistic 
routing schemes, mutual information accumulation provides 
more reliable throughput over weak links, and doesn't require 
the feedback information to determine the forwarder in each 
slot. Thus it reduces the required overhead. 

Compared to the networks in J6) and references 03], lfl6l 
where varying rate can be achieved within a slot through 
adaptive encoding-decoding schemes, the mutual information 
accumulation scheme doesn't require the encoding-decoding to 
be changed in every slot. However, on average, it can achieve 
the same rate by repetitive transmission and information 
bits accumulation. Therefore, the system is more practical 
to implement without sacrificing throughput. On the other 
hand, the information bit accumulation process also brings new 
challenges to the design and analysis of routing and scheduling 
algorithm. 

The contribution of our work is three-fold. 

« We characterize the maximum stability region of a wire- 
less network enabled with mutual information accumu- 
lation under certain natural assumptions. Compared with 
networks where information accumulation is not allowed, 
the system is able to exploit weak links and an expanded 
stability region is thereby achieved. 

• We propose two dynamic routing and scheduling policies 
to achieve the maximum stability region. Both policies 
require simple coordination and limited overhead. 

• The techniques we develop to cope with the temporally 
coupled mutual information accumulation process are 
novel and may have use in queuing problems more 
widely. 

The rest of the paper is organized as follows. In Section HU 
we describe the system model. In Section [TTTJ we present the 



maximum stability region of the network under our system 
model. In Section [IV] and Section [V] we design two different 
routing protocols to achieve the maximum stability region and 
analyze the performance. We present simulation evaluation in 
Section IVIll Finally, we conclude in Section [VIIH Proofs are 
deferred to the appendices. 

II. System Model 
A. The Basic System Setting 

We consider a time-slotted system with slots normalized to 
integral units t £ {0, 1,2,.. .}. There are N network nodes, 
and links are labeled according to ordered node pairs 
for i,j £ {1, . . . , N}. We assume that there are K different 
commodities in the network, K < N, where each commodity 
is labeled according to its destination node, e.g., all packets 
from commodity c should be routed to destination node c. Data 
arrives randomly in packetized units. Let A%(t) denote the 
number of packets from commodity c that exogenously arrive 
at network node i during slot t. Arrivals are assumed to be 
independent and identically distributed (i.i.d.) over timeslots, 
and we let = E{A?(t)} represent the arrival rate of 
commodity c into source node i in units of packets/slot. We 
assume A|(i) < A max for all c,i,t. 

We assume the channel fading states between any pair of 
nodes are stationary, and the active transmitters transmit with 
the same power level. Thus, a fixed reliable communication 
rate over an active link can be achieved in each slot. We expect 
that the algorithms developed in this paper can be modified to 
accommodate more general fading processes. 

We use r*jj- to denote the link rate between nodes We 
assume the link rate is reciprocal, i.e., = rji. We assume 
that each node can transmit at most one packet during any 
given node during any single time slot, i.e., ry < 1 packet/slot. 
We term the links with rate one packet per slot to be strong 
links; the rest of the links we term weak links. For weak links, 
we assume their rates are lower bounded by some constant 
value r m i n , < r min < 1. Define the set of neighbors of 
node i, J\f(i), as the set of nodes with r,-j > 0, j £ Af(i). We 
assume the size of Af(i), denoted as |A/"(i)|, is upper bounded 
by a positive integer d. We define fi max as the maximum 
number of packets a node can successfully decode in any slot, 
which is equivalent to the maximum number of nodes that can 
transmit to a single node simultaneously. Therefore, we have 

We assume the system operates under constraints designed 
to reduce interference among transmitters. Under the interfer- 
ence model, in any timeslot, only a subset of nodes are allowed 
to transmit simultaneously. We assume that transmissions 
from nodes active at the same time are interference free. We 
denote the set of feasible activation pattern as S, where an 
activation pattern s £ S represents a set of active nodes. With 
a little abuse of the notation, we interchangeably use s to 
represent an activation pattern and the set of active nodes of 
the pattern. For any s £ S, we assume all of its subsets also 
belong to s. This allows us to assert that all of the nodes 
in an activation pattern always transmit when that activation 
pattern is selected. This interference model can accommodate 
networks with orthogonal channels, restricted TDMA, etc. 
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B. Mutual Information Accumulation at Physical Layer 

We assume mutual information accumulation is adopted at 
physical layer. Specifically, we assume that if a weak link with 
rij < 1 is activated, rateless codes are deployed at its corre- 
sponding transmitter. When a packet transmitted over a weak 
link cannot be successfully decoded during one slot, instead 
of discarding the observations of the undecodable message, 
the receiver stores that partial information, and accumulates 
more information when the same packet is retransmitted. A 
packet can be successfully decoded when the accumulated 
mutual information exceeds the packet size. The assumption 
fij > r m i n implies that for any active link, it takes at most 
some finite number of time slots, [~l/r TO j„], to decode a 
packet. 

In order to simplify the analysis, we assume that 1/Vjj is 
an integer. The reason for this choice will become clear when 
our algorithm is proposed in Section [HI] If does not satisfy 
this assumption, we round it down to l/fl/r^] to enforce it. 

The challenge in extending the backpressure routing frame- 
work to systems that use mutual information accumulation is 
illustrated through the following example. Suppose that a node 
i has already accumulated half of the bits in packet 1 and half 
of the bits in packet 2. Since neither of the packets can be 
decoded, none of these bits can be transferred to the next hop, 
even though the total number of bits at node i is equal to 
that of a full packet. This means that we need to handle the 
undecoded bits in a different manner. We also observe that, 
if node i never accumulates enough information for packets 
1 or 2, then these packets can never be decoded and will be 
stuck at node i. If we assume that the system can smartly 
drop the undecoded partial packet whenever a fully decoded 
copy is successfully delivered to its destination, coordination 
among the nodes is required to perform this action. The 
overhead of the coordination might offset the benefits brought 
by mutual information accumulation. Moreover, unlike the 
opportunistic model studied in ID-El, given that weak link is 
active, whether or not a successful transmission will occur 
in that slot is not an i.i.d random variable but rather a 
deterministic function of the number of already accumulated 
bits of that packet at the receiving node. This difference makes 
the analysis even more complicated. 

Therefore, in the following sections, we define two different 
types of queues. One is the traditional full packet queue which 
stores fully decoded packets at each node. The other type of 
queue is a partial packet queue. It represents the fraction of 
accumulated information bits from a particular packet. The 
specific definition of the queues and their evolution depends 
on the scheduling policy, and will be clarified in Section [IV] 
and Section [Vl respectively. 

We assume each node has infinite buffer space to store fully 
decoded packets and partial packets. Overflow therefore does 
not occur. 

C. Reduced Policy Space 

The policy space of the network we consider can be much 
larger than that of networks without mutual information ac- 
cumulation. First, because of the broadcast nature of wireless 



communication, multiple receivers can accumulate information 
regarding the same packet in any given slot, and a receiver can 
collect information on a single packet from multiple nodes. 
Keeping track of the locations of different copies of the same 
packet requires a lot of overhead. Second, allowing multiple 
receivers to store copies of the same packet introduces more 
traffic into the network. Therefore, stabilizing the network 
requires a sophisticated centralized control strategy. Finally, 
accumulating information bits of a packet from multiple nodes 
makes the decoding options of a packet increase exponentially; 
thus characterizing the network maximum stability region 
becomes intractable. Therefore, we make the following as- 
sumptions: 

Al. For each packet in the system, at any given time, only 

one node is allowed to keep a fully decoded copy of it. 
A2. In addition to the node with the fully decoded copy, 
one other node is chosen as the potential forwarder for 
the packet. Only the potential forwarder is allowed to 
accumulate information about the packet. 
Restricting ourselves to this policy space may sacrifice some 
part of the stability region that could be achieved under a 
more general policy space. However, as we will see, these 
assumptions enable us to explicitly characterize the maximum 
stability region with the given policy space. Compared with 
systems that operate without the capability to accumulate 
mutual information, our system is able to exploit weak links 
even when one-slot decoding is not possible. The stability 
region will thereby be greatly enlarged. 

If a node directly contributes to the successful decoding of 
a packet at another node, this node is denoted as a parent for 
that packet. Assumptions A1-A2 guarantee that for any packet 
in the network, there is only one parent at any given time. We 
also note that, if we relax assumptions A1-A2, and make the 
following assumption instead 

A3. Every packet existing in the network has a single parent 
at any given time. In other words, the accumulated 
information required to decode a packet at a node is 
received from a single transmitting node. 
then, the maximum stability region under A1-A2 and that 
under A3 are equivalent. Assumption A3 means that we don't 
allow a node to accumulate information from different nodes 
(different copies of the same packet) to decode that packet. 
However, multiple copies of a packet may still exist in the 
network. 

III. Network Capacity with Mutual Information 
Accumulation 

In this section we characterize the optimal throughput region 
under all possible routing and scheduling algorithms that 
conform to the network structure specified in Section Hi- Al and 
Assumption A3. 

At the beginning of each slot, a certain subset s of nodes is 
selected to transmit in the slot. Any node i £ s can transmit 
any packet that it has received and successfully decoded in 
the past. Note that because packets must be decoded prior 
to transmission, partial packets cannot be delivered. For node 
j £ A/"(«), if it has already decoded a packet being transmitted, 
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it can simply ignore the transmission; otherwise, it listens to 
the transmission and aims to decode it at the end of that slot. 
Receiving nodes connected with strong links can decode the 
packet in one slot. Nodes connected with weak links cannot 
decode the packet in one slot. Rather, they need to listen to the 
same node transmitting the same packet over a number of slots. 
A packet is said to be successfully delivered to its destination 
node when the first copy of the packet is decoded by its 
destination node. These assumptions allow for any possible 
routing and scheduling policy satisfying Assumption A3. We 
note that packets arrive at a node from a single commodity 
may get delivered to their destination node in a permuted order 
since each packet may take a different route. 

Let A represent the input rate vector to the system, where A^ 
is the input rate of commodity c entering node i. Define Y?{t) 
as the number of packets from commodity c that originated 
at node i and have been successfully delivered to destination 
node c over [0,t). According to the definition of network 
stability IPTl . a policy is defined as stable if 

Y c (t) 

lim -s-li = A? Vc. (1) 

f — >oo t 

Stronger definitions of stability can be found in J4), @, 
1181 . The maximum stability region or network layer capacity 
region A of a wireless network with mutual information 
accumulation is defined as the closure of all A that can be 
stabilized by the network according to some policy with the 
structure described above. 

Theorem 1 For a network with given link rates {rij } and a 
feasible activation pattern set S, the network capacity region 
A under assumption A3 consists of all rate vectors {A^} 
for which there exists flow variables {/i^-} together with a 
probability tt s for each possible activation pattern s£5 such 
that 

/4>0, fi c ci = 0, /4 = 0, V»,j,c (2) 
5>« + A i<:5>«' Vz^c,Vc (3) 

E^EE^W^' (4) 

c c sgS 

£t.<1, (5) 

ses 

where the probabilities Oij(s) satisfy 

0f,(s) = Oifiga, (6) 

EEW = 1 > V< - (7) 

c 3 

The necessity of this theorem can be proved following the 
same approach in [3 j and is provided in Appendix [A] The 
sufficiency part is proved in Section [IV] by constructing a 
stabilizing policy for any rate vector A that is in the interior 
of capacity region. 

The capacity region is essentially similar to the capacity 
theorem of 0, ifTTl . The relations in (ff]) represent non- 
negativity and flow efficiency constraints for conservation con- 
straints. Those in (O represent flow conservation constraints. 



Those in (01 represent link constraints for each link 
The variable 0^(s) can be interpreted as the probability that 
the transmission over link eventually contributes to the 
delivery of a packet of commodity c at node c, given that the 
system operates in pattern s. In other words, link is on 
the routing path for this packet from its origin node to its 
destination node. 

This theorem implies that the network stability region under 
Assumption A3 can be defined in terms of an optimization 
over the class of all stationary policies that use only single- 
copy routing. Thus, for any rate vector A G A, there exists a 
stationary algorithm that can support that input rate vector by 
single-copy routing all data to the destination. 

The A defined above are in the same form as when a "fluid 
model" is considered. In other words, the extra decoding con- 
straint imposed by mutual information accumulation does not 
sacrifice any part of the stability region. We can simply ignore 
the packetization effect when we search for the maximum 
stability region. 

The X] c A*ij defining the stability region represents the 
effective flow rate over link An "effective" transmission 

means that the bits transferred by that transmission eventually 
get delivered to the destination. If the transferred information 
becomes a redundant copy, or discarded partial packet, the 
transmission is not effective, and doesn't contribute to the 
effective flow rate. We can always make the inequalities OJ-© 
tight by controlling the value of ir s . 

Solving for the parameters {ir s } and {fty(s)} required to 
satisfy the constraints requires a complete knowledge about the 
set of arrival rates {A^}, which cannot be accurately measured 
or estimated in real networks. On the other hand, even when A 
is given, solving the equations can still be quite difficult. In the 
following, we overcome this difficulty with online algorithms 
which stabilize any A within A, but with a possibly increased 
average delay as A approaches the boundary of A. 

IV. T-slot Dynamic Control Algorithm 

In the following, we construct a policy that fits Assumptions 
A1-A2. Although more restrictive than Assumption A3, we 
will see that the stronger assumptions do not compromise 
stability performance in the sense that they do not reduce the 
stability region. To construct a dynamic policy that stabilizes 
the system anywhere in the interior of A, specified in Theo- 
rem Q] we first define our decision and queues variables. 

We assume that each packet entering the system is labeled 
with a unique index k. At time t, < k < J2 C i S T =i ^i( T )- 
If packet k belongs to commodity c, we denote it as £ T c . 
Let ' (t) | represent the binary control action of the system 

at time t. Specifically, (3^' (t) = 1 means that at time t, node 
i transmits packet k to node j. We restrict the possible actions 
so that in each slot each node transmits at most one packet, 
i.e., 

3,k 

and at most one node is chosen as the forwarder for packet k, 
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i.e., 



1,3 



Because of the mutual information accumulation property, 
even if packet k is transmitted over link in slot t, it 

doesn't necessarily mean that packet k can be decoded at 
node j at the end of slot t. In particular, under the fixed link 
rate assumption, successful transmission cannot occur over 
weak links in a single timeslot. We let (t) be an indicator 

function where /y (t) = 1 indicates that the packet k has 
been successfully delivered from node i to node j in slot t. The 
indicator function is a function of the current control action 
and partial queue status at the beginning of slot t. Apparently, 
4 fe) (t) = 1 implies that pg\t) = 1. 

As discussed in Section ITl-BI we define two types of queues 
at each nodes. One is to store the fully received and suc- 
cessfully decoded packet at the nodes, while the other queue 
stores the partially received packets. We use to denote 

the length of node i's queue of fully received packets from 
commodity c at time t, and use P^ (t) to represent the total 
fraction of packet k accumulated by node i up to time t. The 
sum-length of partial queues of commodity c at node i storing 
partial packets can be represented as Ff(t) = J2k£% M (*)■ 
The fraction of packet k, P, (t), can be cleared either when 
packet k is successfully decoded and enters the full packet 
queue Q\, or when the system controller asks node i to drop 
packet k. With a little abuse of notation, we use Q\ and P[ 
to denote the full packet queue and partial packet queue from 
commodity c at node i, respectively. 

Then, according to our Assumptions A1-A2, the queue 
lengths evolve according to 

Qt(t+l)=(Qt(t)- E (t)) + 

+ E Pu\t)fu\t)+ A i(t) (10) 
z,fceT c 

P W (t + 1) = P W (t) + £ (t) -J2 ffl (t)/£ fc) (t) 



A. The T-slot Algorithm 

(9) Our algorithm works across epochs, each consisting of T 
consecutive timeslots. Action decisions are made at the start 
of each epoch and hold constant through the epoch. We 
analyze the choice of T on the stability region and average 
backlog. Any rate vector A inside the capacity region A can 
be stabilized by a sufficiently large choice of T. 

1) Check single-link backpressure. At the beginning of 
each epoch, i.e., when t = 0, T, 2T, . . ., node i checks its 
neighbors and computes the differential backlog weights 

Wn{t) = max[Q 4 c (t) - Q, c (t)]+r„-, j e jV(i). 

C J 

Denote the maximizing commodity as 

4 =argmax[Q^)-Q J c (t)] + 

2) Select forwarder. Choose the potential forwarder for the 
packets in Qi as the node j with the maximum weight 
Wij(t). Denote this node as j* = argmaxj Wij(t). 

3) Choose activation pattern. Define the activation pattern 
s* as the pattern s 6 S that maximizes 
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Any node i 6 s* with Wij? > transmits packets of 
commodity c*j, to j*. The pairing of transmitter i G s* 
and receiver j* and the commodity being transmitted c* 3 -» 
is continued for T consecutive timeslots. 
4) Clear partial queues. Release all the accumulated bits 
in the partial queue Pf, Vi, c, at the end of each epoch. 

The T-slot algorithm satisfies constraints ([H}-©. The "poten- 
tial forwarder" in Step 2) refers to the forwarder of node i 
if node i is active. We clear all of the partial queues in the 
system every T slots (in Step 4)) for the simplicity of analysis. 
It is likely not the best approach to handle the partial queues. 
Intuitively, the performance should be improved if we only 
release the partial queues when a selected forwarder for a 
packet is not the same as the previous one (thus satisfying 
A3). 



(ii) 



where 



(t) 



n 

l-P, 



P?\t) + r u <l 
k \t) Pf>(i)+r K >l (12) 



Theorem 2 The algorithm stabilizes any rate vector satis- 
fying A + e(T) 6 A, where e(T) is a vector with min- 
imum entry e > 1/T. The average expected queue back- 
log limt^oo \ X)t=o Ec., E { ( 3i( T )} in the system is upper 
bounded by 

K NT 2 (fl^ 



NT 2 tfjV(T-l)(/w x +A mox ) 



and (x) + = max{x,0}. Under the assumption that 1/ry is 
an integer for every we have Af\t) = ru. 

Since we only allow there to be only a forwarder for any 
given packet at any time, if /3,^ (t) = 1, any nodes other than 
node m which have accumulated partial information of packet 
k must drop that partial packet k. This effect results in the last 
negative term in (fTTT i. On the other hand, the first negative term 
in ( fTTT i corresponds to successful decoding of packet k, after 
which it is removed and enters Q\ for some c. 



2(eT-l) 2 

The proof of Theorem [2] is provided in Appendix [B] The proof 
is based on the fact that the T-slot algorithm minimizes the 
T-slot Lyapunov drift, which is shown to be negative when 
D Cji <3i(*) is sufficiently large. 

The constructed algorithm proves the sufficiency of Theo- 
rem Q] The intuition behind the algorithm is that by using the 
weak links consecutively over a long window, the potentially 
contributing loss caused by dropped partial packets is kept 
small, therefore, the effective rates over the weak links can get 
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close to the link rate. The algorithm approaches the boundary 
of the capacity region in 0(1/T). 

When T is large enough, the average expected backlog in 
the system scales as 0(T). For this reason, in the next section, 
we introduce a virtual queue based algorithm which updates 
action every single slot. We expect that average backlog under 
the virtual queue based algorithm will be improved since its 
upper bound does not scale as 0(T). 

Given T > -, the upper bound in Theorem [2] is a convex 
function of T, This implies that for any A + e(T) e A, there 
exists an optimal value of T which stabilizes the system and 
introduces minimal delay bound. However, when the arrival 
rates are unknown, it may not be practical to search for this 
optimal value. 

Finally, we note that for some special values of T, the 
network can still be stabilized even when T < 1/e. For 
example, when T is chosen as II(ij) 7~> then, under any 
possible activation pattern s G S, all partial packets are 
decoded at the end of the T-slot window. This implies that 
the policy can stabilize any A + e 6 A. This phenomena 
will be illustrated through examples in Section I VIII For small 
networks, such a value for T can be easily computed and may 
be small; for large networks with many weak links, such value 
may still be quite large. 

V. Virtual Queue Based Algorithm 

In this section, we develop a second algorithm that exhausts 
the stability region without needing to set a large T. Thereby 
it attains better delay performance. As seen in Theorem Q] 
the delay is caused by the long time window of planning and 
infrequent update of the control action. Therefore, in order to 
obtain better delay performance, intuitively, we need to update 
our policy more frequently. This requires us to design more 
sophisticated mechanisms to handle the partial packet queues 
and additional analysis challenges brought by the temporally 
coupled decoding process over weak links. 

Our approach is to construct a network that contains "vir- 
tual" queues, which handle the partial packets and decoding 
process over weak links. The resulting network has the same 
maximum stability region as the original network. By stabi- 
lizing the constructed network, the original network is also 
stabilized. 

Specifically, in order to handle the partial packet queue in 
a simple and effective way, we introduce buffers over weak 
links. We assume there is a buffer at the transmitter side 
for each weak link. Then, if a node wants to send a packet 
over a weak link, the packet is pushed into the buffer. The 
buffer keeps the packet until it is successfully decoded at the 
corresponding receiver. The intuition behind the introduction 
of these buffers is that, since we don't want dropped partial 
packets to lose much effective rate over weak links, once the 
system picks a forwarding node for a particular packet, the 
system never changes this decision. 

For transmissions over weak links, a packet can only be 
decoded and transferred to next hop when enough information 
is collected. Under the T-slot algorithm, since control actions 
updates every T slot, and partial queues are cleared at the end 



of every epoch, it is relatively simple to track the queue evo- 
lution and perform the analysis. When control actions change 
every slot, queues may evolve in a more complicated way 
and thus difficult to track and analyze. In order to overcome 
the analysis challenges, we introduce a second buffer at the 
receiver side of each weak link. Under the proposed algorithm, 
we ensure that the receiver never tries to decode a packet until 
it accumulates enough information, i.e., queue length in the 
second buffer only decreases when it is sufficiently long. By 
doing this, the evolution of the queue lengths can be tracked 
and analyzed easily. 

Essentially, we only need to introduce virtual nodes and vir- 
tual buffers over weak links in order to handle partial packets. 
However, link rates may vary over time and vary in different 
activation patterns (discussed in Sec. IVI-Bb . Therefore, for the 
virtual queue based algorithm, we introduce virtual nodes and 
buffers over both weak links and strong links, and treat them 
uniformly. 

A. The Virtual Queue Vector 

We divide Qf (i) into two parts. The first stores the packets 
that have not yet been transmitted in any previous slots, de- 
noted as Uf(t). The second stores packets partially transmitted 
over some links but not yet decoded, denoted as Vf(t). Since 
each packet in the second part is associated with some link, in 
order to prevent any loss of effective rate caused by dropped 
partial packets, we require these packets to be transmitted over 
the same link until they are decoded. We use (t) to denote 
the information of packet k required to be transmitted over link 
and pjj (t) to denote the accumulated information of 
packet k at node j. We define V£(t) = Y,ke% ^fVX and 

Pij(t) = X^feeT ^ij (*)> wnere we recall that T c is the set 
of packets of commodity c. Note that P[j(t) is different from 
Pj(t) defined in Section [IV] since the latter is associated with 
node j and the former is associated with link 

Associated with virtual queues, we define virtual nodes, 
as depicted in Fig. Q] For the link from node i to node j, 
we associate one virtual node with {Vy} c and a second with 
{Pij}c- The virtual node associated with {Vy} c is denoted as 
Vij, while the virtual node associated with {Py} c is denoted as 
Pij. We have decomposed the weak link into three links: 
{i,Vij),( , ) , (p^ , j ) , with link rates 1 , , 1 , respectively. 
The virtual nodes and corresponding link rates for link (j, i) 
can be defined in a symmetric way. 

We follow the definition of control actions, where j (t) j 
represent the control action of the system at time t. Depending 
on whether or not packet k has already been transmitted by 
node i, we also divide the decision actions into two types, 

X(k) 2(k) 

denoted as p^ and p^ , respectively. In the following 
algorithm, we only make control decisions for the packets at 
the head of corresponding queues. Therefore we can replace 
the superscript packet index (k) by its commodity c without 
any worry of confusion. 

When 0lj{t) = 1, node i pushes a new packet from U?(t) 
into the tail of at the beginning of slot t. This implies 
that the system assigns node j to be the next forwarder for 
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Fig. 1. The constructed virtual system. 

that packet. Once the packet is pushed into V£, we transmit 
the packet that is at the head of Vfi to node j, generally a 
different packet. Thus, an amount of information can be 
accumulated at the tail of P[j(t), and the length of Vfi is 
reduced by r^ . This mechanism ensures that the packets in 
the virtual buffer is transmitted and decoded in a FIFO fashion. 

When 0fHt) = 1, without pushing a new packet into the 
buffer, we retransmit the packet at the head of V^(t), We let 

We require that 

$>S(t)<l, Vi,t. (14) 

Further, we define G {0, 1} as binary decoding 

control actions, ffjif) = 1 indicates that receiver j has 
accumulated enough information to decode the packet at the 
head of P[j(t). It then moves that packet out of P^if) and 
into Uj(t). We impose the following constraint 

/£(*)< P£(i), Vc,i,j (15) 

which indicates that ffj(t) = 1 only when Py(i) > 1, i.e., 
receiver j has accumulated enough information to decode the 
packet at the head of P?j(t). The reason for imposing this 
constraint on is that, we cannot simply use (Py(i) — 

fij(t)) + to represent the queue length of Py after a decoding 
action is taken at that queue. If P[j(t) < 1, even if = 1, 

after the decoding action, the queue length will still be (t) 
since no packet can be successfully decoded. This is not equal 
to (P?j(t) — fij(t)) + , which is zero in this scenario. 

Then, according to constraints ( TBi i and (TT3T >, the queue 
lengths evolve according to 

[f[(t + l) = (^^(t)) + +^0)+i, c (f) (16) 

j i 
V£(t + 1) < (Vg.(t) + ^F(t)(l - r y ) - $f(t)r<i) + 

= (vg-(*)+4 F (*)-Ai(*H-) + < 17 > 

P£(* + 1) < ^ (*) + fti(t)Tii - /§(*) (18) 

The inequalities in (T% and ( TT8l come from the fact that ^? (t) 
and j3f?(i) can be applied to a dummy packet when a queue 
is empty. When the corresponding queue is not empty, the 
inequality becomes an equality. 
Define 

L(U(t),V(t),P(t)) =]T(L/at)) 2 + £(^(*)) 2 

<M c,(ij) 

+ £TO*)-»/) 2 (19) 



where r\ is a parameter used to control the length of P?j(t). 
Define A(t) as the one-slot sample path Lyapunov drift: 

A(i) := L(U(i + l),V(f + l),P(i + 1)) - L(U(t),V(t),P(t)) 

Lemma 1 Under constraints M4\ and M5\ , the sample path 
Lyapunov drift satisfies 

A(t) < 2j2^(t)AUt)-2j2iU^t) - V^tM^t) 

c,i 

-2^[^(t)-P?.(t)]r^.(t) 

- 2 £[i*(f) - »» - (*) + «2 (20) 

a 2 = A'AT(<i + A max ) 2 +2N + KNd (21) 
The proof of this lemma is provided in Appendix [C] 

B. The Algorithm 

In contrast to the algorithm of Section |IV] this algorithm 
updates every timeslot. The purpose of the algorithm is to 
minimize the right hand side of (l20b given the current U, V, P. 

1) Find per-link backpressure. At the beginning of a 
timeslot, node i checks its neighbors and computes the 
differential backlogs. We compute the weight for the link 
between node i and the first virtual node, and, separately, 
the weight for the link between the two virtual nodes. 
Specially, the weight for control action 0\? is computed 
as 

Wlf(t) = [U?(t) - V%{t) + (V£(t) - P£(i))r y ]+ 
and the weight for control action 0f? is computed as 

W?f(t) = [V° j (t)-P° j (t)]+r ij 

The weight of commodity c over link is W?j(t) = 
max{Wlf(t),W^ c (t)}. The weight for the link is 

W ij (t) = maxW? : j (t), 

and the optimal commodity 

c*j = ajgmaxWy(i). 

2) Select forwarder. Choose the potential forwarder of the 
current slot for node i with the maximum weight Wij (t) 
and denote it as j* = argmaxj Wij{t). 

3) Choose activation pattern. Define the optimal activation 
pattern s* as the pattern s G S that maximizes 

i £ s 

4) Transmit packets. For each i e s*, if W^* > 0, let node 
i transmit a packet of commodity cjy. to node j*. For 
strong links, node i transmits a packet from the head of 
Uf . If link is a weak Unk, and W lj {t) = W^(t), 
node i pushes a new packet from U? into Vy. and 
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transmits the packet from the head of V?* (t); otherwise, 
node i resends the packet at the head of Vfi? (t). 
5) Decide on decoding actions. For each link (i, j) and each 
commodity c, we choose ffj(t) G {0, 1} to maximize 

[pzw-Tf-ufmzit) (22) 

where 77 is a parameter greater than or equal to 1 . We let 
/£(*) = 1 when If ^t) -r)-U?(t) = 0. 

Lemma 2 Under the above virtual queue based algorithm: 
(a) If P[j(t) < r\ for some weak link and slot t, then 

fij(t) = 0. (b) If P[j(to) > t] — 1, then, under the proposed 
algorithm, (t) > 77 — 1 for every t > to. 

Proof: In order to maximize (-Py(i) — T] — Uj(t))f^(t), 
= 1 only when P?.(t) - 77 - > 0. Therefore, 

if Pij(t) < i], fij(t) must equal zero, which proves (a). 

Now suppose that (t) > 77 — 1 for some slot t. We show 
that it also holds for t + 1. If P[j(t) > r], then it can decrease 
by at most one packet on a single slot, so that P^if + 1) > 
P^t) - /£(*) > V- 1. If < »7, we must have /£(t) = 

0, the queue cannot decrease in slot t, and we again have 
P£(i+1)>7?-1. U 

With Lemma [2] we can see that when setting 77 = 1, under 
the proposed algorithm, if P£ (t) < 1 for some weak link (i, j) 
and slot t, then ffj(t) = 0. /f ? - (t) can only equal one when 
Pij{t) ^ 1- Thus, constraint (fT5l l is satisfied automatically for 
every slot under the proposed algorithm. 

Theorem 3 For a network with given link rates {rij} and a 
feasible activation pattern set S, the network capacity region 
A' for the constructed network consists of all rate matrices 
{A£} for which there exists flow variables {^J},v — 1,2,3 
together with probabilities tt s for all possible activation pat- 
tern s G S such that 

< c >0, » v c t = 0, ^f = Q, Vi,j,v,c (23) 
t j 

I'f; < I'n Vi.j.c (25) 

E^^EE^- Vi ^' (26) 

c c sGS 

Y, t- < 1 (27) 

where the probabilities 0^{s) satisfies 

e c lJ { S )=Qifi{s, 53^.(«) = l,Vi (28) 

Proof: The necessary part can be proved in the same way for 
Theorem [TJ The sufficiency will be proved through construct- 
ing an algorithm that stabilizes all rate vectors satisfying the 
constraints. ■ 

In this constructed virtual network, fj,jj,p,fj,fj,fj can be 
interpreted as the flow over links (i,Vij), (vij,Pij), (pij,j), 
respectively. The constraints (l23l represent non-negativity and 



flow efficiency constraints. The constraints in (l24l i. d25l l repre- 
sent flow conservation constraints, where the exogenous arrival 
flow rates for nodes are zero. The constraints in ( |2oT > 

represent the physical link constraint for virtual link (vij,Pij), 
which equals the link constraint for the real link in the 
original system. Note that there is no explicit link constraints 
for (i, Vij ) and (pij , j), since the transfer of packets over these 
links happen at the same node, and there is no physical link 
constraint on them. 

Lemma 3 The network capacity region for the virtual network 
A' defined in Theorem\3\is equal to that for the original system 
A defined in Theorem UJ 

Proof: First, we show that if A G A', then it must lie in A as 
well. This can be shown directly by letting fi^ = fijj, Thus, 
we have fijj < fi^ < Plugging into ( |24> . we have (01, 
i.e., if A satisfies the constraints in (f23t-(l28l>, it must satisfy 
©-(01 as well. Thus A G A. 

The other direction can be shown in the following way: we 
prove that for any A + e G A, A + 2 d+i e ^' ' wnere d is the 
maximum degree of the network. 

Since A + e G A, we have 

5>H +A <+ e ^ZH-' Vi ^ c - (29) 

I j 

By letting ^ = \x\-, we have that (|26]l-(|28]i satisfied. At the 
same time, we let p,}? + e\ = — ei = /ifj, and plug them 
into ( |29l , which gives 

^( /i 3c_ ei ) +A c + e <^( /i lc + £l)) Vl ^C. (30) 

I 3 

Therefore, 

Y^ + K + e-2de x <Y J ^h Vz^c. (31) 
1 3 
By letting e — 2de\ = e\, we have 

5>^ + A? + ei <5>^ Vi^c (32) 
l j 

Mtf+ei=Mtf» l4j+tl = l4j- (33) 

Thus, we have A + ei G A'. As e — > 0, ei approaches zero as 
well. Thus, A = A'. ■ 

Theorem 4 For 77 > 1, the proposed algorithm stabilizes any 
rate vector satisfying A + e G A. The average expected queue 
backlog in the system is upper bounded by 

(2d + l){KN{d + A max f + 2N+ (277 + l)KNd) 
e 

The proof of the theorem is given in Appendix ID1 Since the 
upper bound is monotonically increasing in 77, we can always 
set 77 to 1 to achieve a better delay performance. 

The algorithm updates every slot. This avoids the delay 
caused by infrequent policy updating in the T-slot algorithm. 
On the other hand, we introduce virtual queues in the system. 
Since the differential backlog in Step 1) of the algorithm is not 
the physical differential backlog between nodes in the 
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real system, the inaccuracy of the queue length information 
can, potentially, increase the average backlog in the system. 
This is reflected by the 2d+l factor in the upper bound. But for 
arrival rate vectors that are close to the boundary of network 
capacity region, the T-slot algorithm can only stabilize the 
system if T is large. In such situation thus the virtual-queue 
based algorithm attains a better delay performance. 

The algorithm exhausts the maximum stability region of 
the network without any pre-specified parameter depending 
on traffic statistics. Compared to the T-slot algorithm where 
the stabilizing parameter T actually depends on how close a 
rate vector A gets to the boundary of the network stability 
region A, this is a big advantage. 

VI. Discussions 

A. Enhanced Virtual Queue Based Algorithm 

According to Theorem [2] the average expected backlog in 
the system under T-slot algorithm is 0(T 2 ), which indicates 
poor delay performance when T is large. The virtual queue 
based algorithm avoids the long delay caused by the infre- 
quent updating of queue length information. However, because 
of the introduction of virtual nodes, packets accumulate in 
virtual queues over weak links, which negatively impacts 
delay performance, especially when the system is lightly 
loaded. In order to improve delay performance, we proposed 
to enhance the virtual queue based algorithm by adjusting the 
term associated with Vij in the Lyapunov function. 

Define a modified Lyaponov function 

L(U(t), V(t),P(t)) = E(^W) 2 + E (W + 7/r <3 -) 2 
<M c,(»,j") 

+ Era*)-'?) 2 - (34) 

c,(i,j) 

Compared with ([T9T l, we have added a term 7/V.y to each 
Vy (i) in (f34-b . This is equivalent to add queue backlog 7/r.y in 
the virtual queue. Following an analysis similar to that of the 
virtual queue algorithm (cf. Appendix [Cj, we can show that 
under constraints (fl4l i and $15[ , the sample path Lyapunov 
drift satisfies 

A(t)<2^[/^ W -2^[^( i ) + 7 /r y -^(^( i ) 

c,i c y i,j 

-2Y^m j (t)-r 1 )-U°(t)}f? j (t) + a 3 

where 0:3 is a positive constant. In order to minimize the one- 
step Lyapunov drift, we substitute the following "modified" 
step Ml) for step 1) of the virtual queue based algorithm as 
follows: 

Ml) Find per-link backpressure. At the beginning of a 
timeslot, node i checks its neighbors and computes the 
differential backlogs. We compute the weight for the link 
between node i and the first virtual node, and, separately, 
the weight for the link between the two virtual nodes. 



Specially, the weight for control action /3,^ c is computed 

as 

W^{t)=[Ut{t)-{V l c j {t)+ 1 /r ij ) 

+ {V? j {t) + 1 /r ij -Pi j {t))r ij ] + , 

and the weight for control action f3fj is computed as 

Wg(t) = [Vg(t) + 7 /r« - P?M +r «- 

The weight of commodity c over link is W^(t) = 
max{Wlf(t), Wffit)}. The weight for link is 

W ij (t) = maxW? j (t), 

and the optimal commodity 

c*j = argmaxWy(t). 

The rest of the steps remain the same as in Sec. IV-BI 
Following similar steps as in the proof of Theorem 4, we 
can show that the enhanced version also achieves the maxi- 
mum stability region. The intuition is, in heavily backlogged 
regime V^{t) 7/Vy, therefore the added queue backlog is 
negligible and doesn't impact the stability of the queue. 



As in the virtual queue based algorithm, in the enhanced 
algorithm, we also set 77 = 1 to guarantee that = 1 

only when Pfj{t) > 1. We now discuss the effect of the 7 
parameter. We set 7 = 1/2. By setting 7 to be some positive 
value, the system adds some virtual backlog to buffer Vij, thus 
preventing packets from entering the empty buffers over the 
weak links when the system starts from an initial empty state. 
It also increases backpressure between Vij and Py . Therefore, 
packets tend to be pushed through links more quickly, and 
the decoding time is shortened accordingly. Besides, in the 
modified Lyaponov function, we select weights for the virtual 
backlogs of virtual queues as the inverse of link rates. The 
reason for such selection is that the number of slots required 
to deliver a packet through a link is equal to the inverse of 
the link rate. We aim to capture different delay effects over 
different links through this adjustment. The intuition behind 
the enhanced algorithm is that, when the system is lightly 
loaded, passing packets only through strong links can support 
the traffic load while still providing good delay performance. 
Therefore, using weak links is not necessary, and using strong 
links is preferable. Setting the virtual backlog length to be 
7/Vjj forces packets to select strong links and improve the 
delay performance of the virtual queue based algorithm in the 
low traffic regime. When the system is heavily loaded and 
strong links cannot support all traffic flows, the differential 
backlogs over certain strong links eventually decrease, and 
weak links start to be used. The enhanced algorithm essentially 
is a hybrid of the classic backpressure algorithm (T = 1) 
and virtual queue based algorithm. It always stabilizes the 
system, and automatically adjusts the portion of slots that the 
system operates under each algorithm to maintain a good delay 
performance. 
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B. Dependent Link Rates 

For the simplicity of analysis, in previous sections we 
assume that the link capacity for a given transmitter-receiver 
pair is fixed. We can easily relax this assumption and gen- 
eralize the system model by assuming that the link rates are 
not fixed but are rather a function of the chosen activation 
pattern. Specifically, to be better able to capture the effects 
of interference, for link pair we assume the link rate 

under activation pattern s is ry-(s). Then, the network capacity 
region under the new assumption is characterized by the same 
inequalities in Theorem Q] except that in eqn. (O rij(s) is used 
instead of ry. 

The scheduling algorithms should be adjusted accordingly 
in order to achieve the network capacity region. E.g., for the 
T-slot algorithm, in each updating slot, after determining the 
maximizing commodity c*j for each link the system 

selects the activation pattern, as well as the corresponding 
forwarder for each active transmitter, to maximize 

max [Qf(t)-Qf(t)] + r l3 (s). 

The remaining steps remain the same. For the virtual queue 
based algorithm, the maximizing commodity should be jointly 
selected with the activation pattern and the corresponding 
forwarders at the same time. 



C. Distributed Implementation 

The T-slot algorithm and the virtual queue based algo- 
rithm presented in the previous sections involving solving a 
constrained optimization problem (max-weight matching) in 
a centralized fashion. Here we consider distributed imple- 
mentations. We assume nodes have information of the link 
rates between themselves and their neighbors, and the queue 
backlogs of their neighbors. 

In interference-free networks where nodes can transmit 
simultaneously without interfering with each other, minimizing 
the Lyaponov drift function can be decomposed into local 
optimization problems. Each node individually makes the 
transmission decision based only on locally available infor- 
mation. 

In networks with non-trivial interference constraints, we 
can use the message passing algorithm proposed in |fl9l to 
select the activation pattern in a distributed way. First, each 
node selects its forwarder based on its local information. 
Then, the system uses the belief propagation based message- 
passing algorithm to select the optimal activation pattern that 
minimizes the Lyaponov drift. If the underlying interference 
graph is bipartite, lfl9l shows that the the message-passing 
algorithm always converges to the optimal solution. Another 
possible approach is to develop carrier sensing based random- 
ized scheduling algorithms. Carrier sensing based distributed 
algorithms are discussed in |20l . ETI . etc. The throughput 
optimality of these algorithms are established under certain 
assumptions. Some other distributed implementations of back- 
pressure algorithms are discussed in Chapter 4.8 of ifTTl . 




Fig. 2. The 4-node network used to compare the T-slot algorithm and the 
virtual queue based algorithm. The number labeling each link is the rate of 
that link. 
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Fig. 3. The maximum stability region of the 4-node network. The inside 
triangular region is the network capacity region when MIA is not exploited. 

VII. Simulation Results 

In this section we present detailed simulation results. These 
results illustrate the basic properties of the algorithms. 

A. A single-commodity scenario 

First consider the 4-node wireless network shown in Fig. [2] 
where the links with nonzero rates are shown. We consider 
a single commodity scenario where new packets destined for 
node 4 arrive at node 1 and node 2 according to independent 
Bernoulli processes with rate Xi and A2, respectively. Node 
3 does not have incoming packets. It acts purely as a relay. 
We assume that the system does not have any activation 
constraints, i.e., all nodes can transmit simultaneously without 
causing each other interference. 

The maximum stability region A, shown in Fig. [3] is the 
union of rate pairs (Ax, A2) defined according to Theorem Q] 
If mutual information accumulation is not allowed, the corre- 
sponding maximum stability region is indicated by the dashed 
triangle inside A. This follows because when weak links are 
not utilized, the only route from node 1 to node 4 is through 
node 2, thus the sum of arrival rates from node 1 and 2 cannot 
exceed link rate 1 . When mutual information accumulation is 
exploited, the weak link from node 1 to node 4 with rate 1/9 
can be utilized, expanding the stability region. 

We first compare the performance of the T-slot algorithm 
for different values of T. For each T, we conduct the simu- 
lation for arrival rates Ai = A2 = A ranging from to 0.55. 
The resulting average backlog curve is shown in Fig. [4] When 
T = 1, the weak links cannot be utilized, and the algorithm 
can only stabilize the arrival rates up to A = 1/2, which 
is point A in Fig. [3] When T = 9, the reciprocal of the 
link rate 1/9, the algorithm can stabilize arrival rates up to 
A = 9/17, corresponding to point B in Fig. [3] In this case, 
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Fig. 4. Comparison of average backlog in the system under the algorithms. 

all of the partial packets transferred over weak link (1, 4) are 
eventually decoded, and that weak link is fully utilized. This 
is a special scenario since the value of T perfectly matches 
the rate of weak link. For larger networks that consist of many 
weak links, selecting T to match all of the weak links may 
be prohibitive, since such value can be very large. Except for 
such perfect matching scenarios, for more general values of 
T, the weak link is partially utilized, therefore, the maximum 
A the algorithm can stabilizes is some value between 1/2 and 
9/17. In general, a larger T stabilizes larger arrival rates, and 
results in an increased average backlog in the system. This is 
illustrated by curves with T = 15,60 in Fig.H 

Fig. S also plots the performance of the virtual queue based 
algorithm. The system can be stabilized up to A = 9/17 under 
the virtual queue based algorithm. Compared with the T-slot 
algorithm, the virtual queue based algorithm attains a much 
better delay performance for large value of A, i.e., when the 
system is in heavy traffic regime. It dominates even the curve 
with T = 9 at high rates. For small values of A, the virtual 
queue based algorithm has a worse performance in terms of 
delay. This is because the algorithm requires the virtual queues 
to build up certain lengths in order to push packets through the 
weak links. The virtual queue based algorithm has relatively 
constant delay performance for A G [0, 1/2], while under the 
T-slot algorithm, the average backlog increases monotonically 
with A. 

B. A multi-commodity scenario 

Next we consider the 10-node network shown in Fig. [5] 
We consider a multi-commodity scenario in which packets 
destined for node 10 arrive at node 1 and packets destined 
for node 9 arrive at node 2. Arrivals are distributed according 
to two independent Bernoulli processes with rate Ai and A2, 
respectively. We assume that the system does not have any 
activation constraints so that all nodes can transmit simulta- 
neously without causing interference. 

The maximum stability region A is shown in Fig. [6] If 
mutual information accumulation is not allowed, the corre- 
sponding maximum stability region is the dashed triangle 
inside A. This follows because when weak links are not used, 
the routes from node 1 to node 10 and the routes from node 2 
to node 9 must pass through link (4, 7), thus the sum of arrival 




Fig. 5. The 10-node network used to compare the T-slot algorithm and the 
virtual queue based algorithm. The number labeling each link is the rate of 
that link. 
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Fig. 6. The maximum stability region of the 10-node network, where B = 
(11/15, 11/15). The dashed triangular region is the network capacity region 
when MIA is not exploited. 



rates from node 1 and 2 cannot exceed that link rate. When 
mutual information accumulation is exploited, weak links can 
be utilized and form additional routes 1— >5— s-6— > 10, 
2 — > 3 — > 8 — 5- 9, thus an expanded stability region can be 
achieved. 

We first compare the performance of the T-slot algorithm 
for different values of T. For each T, we conduct the sim- 
ulation for arrival rates Ai = A2 = A ranging from to 
0.75. The resulting average backlog curve is shown in Fig. [7] 
When T = 1, the weak links are not utilized, so the algorithm 
can only stabilizes the arrival rates up to A = 1/2, which 
is point A in Fig. [6] When T = 30, which is the reciprocal 
of the link rate product and perfectly matches the rates 
of weak links, the algorithm can stabilize the arrival rates 
up to A = 11/15, corresponding to point B in Fig. [6] For 
more general values of T, the weak links are partially utilized, 
therefore, the maximum A the algorithm can stabilizes is some 
value between 1/2 and 11/15. In general, a larger T stabilizes 
larger arrival rates, and results in an increased average backlog 
in the system. This is illustrated by curves with T = 5, 17 in 
Fig. [7] In order to achieve the boundary point, a large T is 
required in general. 

In Fig. [6j we also present performance results from sim- 
ulation of the virtual queue based algorithm. As expected, 
the system can be stabilized up to the edge of the stability 
region, A = 11/15. Compared with the T-slot algorithms, 
the virtual queue based algorithm attains a much better delay 
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Fig. 7. Comparison of average backlog in the system under the algorithms. 

performance in heavy traffic regime. It dominates the curve 
with T = 30 over the displayed rate region. Similar to the 
single-commodity scenario, for small values of A, the virtual 
queue based algorithm does not show much advantage in 
terms of delay. Finally, we also provide simulation results 
for the enhanced virtual queue based algorithm in Fig. [7] 
The enhanced algorithm stabilize the full sepctrum of arrival 
rate, i.e., every input rate vector up to A = 11/15. The 
delay performance in the light traffic regime (A < 1/2) is 
improved under the enhanced version, with a smaller penalty 
of delay in the heavy traffic regime. The delay performance 
transition around A < 1/2 can be explained by the hybrid 
of and automatic adjustment between the classic backpressure 
(T = 1) and virtual queue based algorithm. In the simulation, 
we set 7 = 1/2. The value of 7 decides the tradeoff between 
the delay performance in the light traffic regime and the delay 
performance in the heavy traffic regime. 

VIII. Conclusions 

In this paper, we analyzed the optimal routing and schedul- 
ing policy when mutual information accumulation is exploited 
at the physical layer in a wireless network. We first char- 
acterized the maximum stability region under the natural 
assumption of policy space, which is shown to surpass the 
network capacity region when mutual information accumu- 
lation is not allowed. Two scheduling policies are proposed 
to cope with the decoding process introduced by mutual 
information accumulation. The T-slot algorithm and the virtual 
queue based algorithm can both achieve the maximum stability 
region but the latter has significantly reduced delay in heavy 
traffic regime. We also compared the performance under these 
two policies analytically and numerically. 

Appendix A 
Proof of the Necessity Part of Theorem[T] 

Suppose that a stabilizing control strategy exists. It is 
possible that the strategy may use redundant packet transfers, 
i.e., allowing multiple copies of a packet to exist at the same 
time in the network. However, under assumption A3, each 
packet can only have a single parent, i.e., once a node starts 
to accumulate information for a packet, it can only decode 



the packet if the total received information from a single 
transmitter exceeds the amount of information contained in 
that packet. 

Define Xf(t) as the total number of packets of commodity 
c that have arrived at node i up to slot t. Define T> c (t) as 
the set of distinct packets of commodity c delivered to the 
destination node c over [0, i), and D c (t) = \V c (i)\ be the 
total number of such distinct packets. Then, we have D c (t) = 
E"=i where is defined in Section|m] If multiple 

copies of a packet depart from the system, we only count the 
first delivered copy. In a stable system, for any traffic flow of 
commodity c entering source node i, the average number of 
distinct packets delivered to destination node c in each slot 
must be equal to its arrival rate, thus, we have 



lim 



lim 

t— >oo 



(35) 



For each distinct delivered packet k, there is a single routing 
path this packet took from its origin node to its destination 
node. Denote Dfj (t) as the total number of distinct packets in 
T> c (t) transmitted from node i to node j over [0,i). We have 



£ D c u {t)+Ym 



(36) 



Define T s (t) to be the number of slots within which the 
system operates with activation pattern s £ S up to time t, 
and let T^{s, t) be the number of slots that link (i, j) is active 
and transmitting a packet in V c (t) under activation pattern s 
up to time t. Therefore, we have 



Dm 



Tti(a,t)r i: 



Thus, 



Define nlAt) 



t 

D-At) 



£ — =rr^n 



ses 



T s (t) 



(37) 



(38) 



7T a (t) 



T s {t) 
t 



and ^.( S ,() = |J. 
We note that since we can deliver at most one packet to i from 
j in each time slot, 



</&(*)<!, /4W=0, Mc,i(*) = 0. 



(39) 



Because only one activation pattern is allowed per slot, we 
have 



(40) 



On the other hand, since a node can only transmit a single 
packet in any slot, and we only count distinct copies of 
packets, then, if node i is transmitting a packet of commodity 
c at time t, at most one of the received copies at its neighbors 
can be counted as a distinct packet in T> c {t). Thus, we have 



0?(s,i)<l ifie*. 



(41) 



We can always make inequality fiTt tight by restricting to the 
policy space that a node can only transmit when it is necessary, 
i.e., if node i's transmission at time t does not contribute to 
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the delivery of any distinct packet to its destination, it should 
keep silent and be removed from the activation pattern at time 
t. The remaining active nodes form another valid activation 
pattern, and gives T s (t) = J2c j ^ij( s >0 f° r everv i £ s - 

These constraints define a closed and bounded region with 
finite dimension, thus there must exist an infinite subsequence 
t over which the individual terms converges to points ir s 
and Oji(s) that also satisfy the inequalities (f39b-d4TT>: 



Jim = fj^j, 

t— >oo 

lim 7T s (t) = 7T S , 
t— >oo 

Jim 0Us 7 t) = eUs)- 



(42) 
(43) 
(44) 



Furthermore, using 

E^ 

t 

This proves the result. 



in d36t and taking t —> oo yields 
•V E'V Vl ^ c - < 45 > 



Appendix B 
Proof of Theorem[2] 

Our algorithm always transmit packets from the head of 
Qi(t), and there is at most one full copy of each packet in 
the network. Therefore, without worry of confusion, in the 
following analysis we drop packet index k and use commodity 
index c instead as the superscript of control actions {/3^} and 

First, define the Lyapunov function 

and the T-slot sample path Lyapunov drift as 

A T (t) := L(Q(t + T))-L(Q(t)). 
Then, we have the following Lemma. 

Lemma 4 Assume the system changes its policy every T slots 
starting at t = 0. For all t = 0, T, 2T, . . . and all possible 
values o/Q(t), under a given policy {0fj{t)}, we have 



A T (t) < J2 -2Q?(t)(T -Tj^m) 



T-l 



r=0 



where 



ai = KNT 2 {ii max + A max ) 2 + NT 2 



m 



(46) 



(47) 



Proof: Under any given policy, the queue length evolves 
according to 

Q?(* + i) = («?(*) -£/*&(*)/§(*)) + 

3 

+ £#i(f)/j C i(f)+4 5 (*)- ( 4 8) 



Considering the policy which updates at t = 0, T, 2T, . . ., we 
have 



Qi(t + T) 



T-l 



< 



(w)-EEW)^(* +t )) + 

T-l T-l 

+ £ £ 0£(*)/m(* + ^) + E ^ c (* + t ) < 49 > 

T = Z 

< fe*)-£/^Hn/n 



T = 

+ 



T-l 

+ E ft® L r <» T J + E ^ 5 (* + r )- ( 50 > 

In (|49l >. we upper bound QJ(i + T) by moving the negative 
terms into the function (•)+. This follows from the facts that 

max[a + b — c, 0] < max[a — c, 0] + b 
max[max[a, 0] — c, 0] = max[a — c, 0] for a,b,c > 0. 

This is equivalent to letting node i transmit packets existing in 
Q1(t) only, i.e., even if there are some packets of commodity 
c arrive at node i in the epoch and all of the packets existing 
in Q1(t) have been cleared, they are not transmitted until 
next epoch. Since under the policy, these packets may be 
transmitted to next hop, the actual queue length Qf(t+T) can 
be upper bounded. Eqn. d50t follows from the fact that over the 
T-slot window, the successfully delivered packets (including 
dummy packets) from node i to node j is [Tj3fj(t)rij\, and 
recall f3fj (t) is held constant for the whole epoch. Since both 
sides of the inequality are positive, it holds for the square of 
both sides, thus, 

(Q-(t + T)) 2 

< (q?(*)-E^)l^ t j) 2 

3 

T-l 

+ 2Qm ( £ LniTj + E A i(t + t)) 

T-l 2 

+ (E^«(*)NTj + E^ c (*+ r )) ( 51 > 



T = 



<(Q|(i)) 2 -2Qf(t)(^^.(t)Lr^Tj 

3 

T-l 

- E A C i(*) LHiTj - E + rj) + C?, (52) 



T = 



where 



T-l 



= (£/*£(*) Lr«rj + £ J 4?(*+r; 

'5>&(t)LryTj 



T = 

2 



We use Q?(t) instead of (Q?(f) - £\ /9&(t) IryTj) for the 
cross term in (1511 . Since the former is always greater than the 
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latter, the inequality ((5TJ holds. Therefore, we have Proof: The result is an immediate consequence of TheoremQ] 

fsr^ ^ ne mm ition i s to think E{/3 C }rjj as fj,^ in ©-(O. The 

necessary part is obtained directly. The sufficient part will be 



c,z 
T-l 



shown in the following section. 



-J^AKt + r^+J^Cf (53) 

t— c,i 

< £ -2Qf(t)(^^.(t)(ryT - 1) - ]T 0S(t)r K T 

c,i j Z 

T-l 

-^^(t + rjW^C? (54) 



r— c,i 

<X;- 2 0i(*)( r E^(*) r «- T EAi(*)n < 



C,2 

T-l 



For any {A + e} G A, Lemma [5] shows that the T- 
slot algorithm minimizes the right hand side of ( |57] i for 
A c (t _|_ T ) _ {\ _|_ (j c (55) any alternative policy satisfying ((H)-®- On the other hand, 

r=0 ' i Corollary Q] implies that such policy can be constructed in a 

randomized way which is independent of current queue status 

where AtU) is the T-slot Lyapunov drift, (|541 > follows from ... . , , ,. e ■ r^si ^ u- • r m a 

1 w 3 F . . N — ' in the network and satisfying ( 1581 1. Combining Lemma |5J and 

the fact that x - 1 < \x\ < x, and (|55J follows from the fact Coro j lar y [JJ we have 

that < 1- Based on the assumptions that Af(t) < 

A m ax, the maximum number of decoded packets at a node is E{At(£)|Q(£)} 

upper bounded by MmQX , and the constraint ®, we have < ^ -2Q?(t) ((j8£ (t)r« - ]T - A?)t - l) + a x 

J2 C "< KNT 2 (ti max + A max f + NT 2 := a x ^ « 

<2-2Qi(*)(eT-l) + ai 
The proof is completed. ■ M 

Taking expectations of the above inequality over the distribu- 
Lemma 5 For a given Q(t) on slot t = 0,T, 2T, . . ., under tion of Q(*)> we have 

f/ie T-slot algorithm, the T-slot Lyapunov drift satisfies E{Ay(i)} < — 2E{Q c (t)} (eT — 1) + ot\ (59) 



A T (t) <E- 2 ^ c w( T EA^(iK- -^E^w 



Summing terms over i = 0,T, . . . , (M — 1)T for positive 



T-i integer M yields 



r=0 



5^(t + r)-l) +a x , (56) 



E{L(Q(A/T))-L(Q(0))} 



M 

w/iere are any alternative (possibly randomized) ^, M ~o X) ' — 2E{Q c (toT)} (eT — 1) 
decisions that satisfy (HI"©. Further more, we have < 1 h ci'i, 



{A T (t)|Q(t)} < E{^-2g i c (t)(r^^.(t)r y i.e., 

-T2^(t)r„-TA|-l) Q(t)}+ ai (57) ^ ^ E{Q 4 (mT)} < - 



L(Q(0))+M Ql 
" 2(eT~l) ' (60) 

Proof: Given Q(t), the T-slot algorithm makes decisions to 

minimize the right hand side of (g6]l. Therefore, inequality We drop L(Q(MT)) in d60j since L(Q(A/T)) > based on 

(l56l l holds for all realizations of the random quantities, and the definition of Lyaponov function. On the other hand, for 

hence also holds when taking conditional expectations of both t = 0, T, 2T, . . . and < t < T, we have 
sides. Thus, we have (IBTTi. ■ _ c/ > awja , / 4 \ /-tin 

„ , . . r \ , ~i • • ./ - a where ^4 ma x is the maximum arrival rates and u ma ,x is the 

Corollary 1 A rate matrix { A + e} is in the capacity region A /-max 

w . . . . , ., , . • ji maximum number of decoded packets in a slot for any node. 

;/ and only if there exists a stationary (possibly randomized) ^ f 

algorithm that chooses control decisions subject to constraint 

(|S|-((9| and independent of current queue backlog to yield A Sz} , ^l,^, 1 * , 

r _ ' \ £££E{Q^mT + T)}<££(T-l)E{Q?(mT)} 

E { E Pv r a ~ E ^ ru ~ A * } - 6 v * ^ c - (58) m=0 T=1 m=0 c ' i 

j 1 +MT(T-l)KN(n max + A max )/2 
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Combining with d60l ), we have 
1 M-l T-l 



m— r— c,i 
M-l 



^ jj E E E {<9i( mT )} + KN(T - l)(/w + A ma x)/2 



m— c.i 



~ 2M(eT- 1) 2 
Letting A/ — > oo, we have 



< 



r— c,i 

ai , ^(T-l)(/i TOOa: + A, 



2(eT-l) 2 

Therefore, when T > -, the average expected backlog in 
the network is bounded. Thus, the system is stable. Since our 
algorithm operates under assumptions A1-A2, the total number 
of partial packets in the system, ^2 ci Pi(t), is always upper 
bounded by ^2 c iQi(t). Therefore, if the {Qf(t}} are stable, 
the {P[(t)} must be stable, and the overall system is stable. 



Appendix C 
Proof of LemmaQ] 

Based on (fT6l)-(fT8l). we have 

(^+i)f<(^)) J + (E^) 

-2^(*)(E4 c (*)-E/«w-^(*) 

3 I 

+ (£/S(*) + ^ c W 



(62) 



(y$(t + 1)) 2 < (^(t)) 2 + 2V£(t) (p%{t) - ff 3 (t) nj ) 



4 c ( 

(i*(t + l)) 2 < {P^{t)f + 2P^{tm j {t)r ij (t) - /f.(t)) 



(63) 



p?At+i)>pm-fm 



(64) 
(65) 



Thus, 



aw < £-2^(*)(E4- w -£/«(*) -^ c (*), 

+ ^2^.(t) (/?£(*)-#,•(*>•«) 

c,i,j 



where 



c = E(E4 c w) +£(£/«(*) +4 c w 



+ E [ - W) r «) a + C8« (*>«(*) - /«(*))' 



Because of constraints (fT4l-([T5]l, we have 

C < KN(d + A max ) 2 + 2N + KNd :— a 2 - (66) 
Combining items with respect to link we have ( |20l >. 

Appendix D 
Proof of TheoremO 

Corollary 2 A rafe vector X + e is in the capacity region A' if 
and only if there exists a stationary (possibly randomized) al- 
gorithm that chooses control decisions (independent of current 
queue backlog) subject to constraints ( 1741 ), to yield 



E{^K-=E{4 c } + e 
E{/P.}=E{/3P,}r ij+ e 



E 



{E/£-EAi c - A i}^ £ v ^ c 



(67) 
(68) 

(69) 



Proof: The result is an immediate consequence of Theorem [3] 
The intuition is to let V.{ ^. }'',, = /if/- E {/3^ c } = /4? and 
E{/P} = /i??. The necessary part is obtained directly. The 
sufficient part will be shown in the following proof. ■ 

Lemma 6 Under the virtual queue based algorithm, 
+ [i*(t)-»?-tf;(t)]/&(t)} 



(70) 



/or any ofner binary control policy {$}j,$ij, fij) satisfying 



Proof: This lemma is an immediate consequence of the fact 
that the virtual queue based algorithm maximizes the left 
hand side of (f70l i while satisfying ( TT~4-b . The constraint (fT~5T > 
is satisfied automatically. ■ 

Based on Lemma [T| (|20T> and Lemma [6] we have 

E{A(t)|U(f),V(f),P(f)} 



< 



< E 



-[J*(t) rj UW)]f&t)\ U(t),T(i),P(i)} 
2E^(*)Af + a 2 

{ £ -2um ( £ - E &(*) - A i(*)) } 

{£2Vg(t) (^/(t)-j8?-(tK-)} 
{ £ 2i^(t)C8P-(<)r«(t) - /£•(*)) + 2t ? /P-(*)} + a 2 



E 



E 
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Since for A + e e A, we have A + e/(2d + 1) G A', thus 
E{A(t)|U(f),V(f),P(f)} 

< -2 ( £ um + y: v§ (t) + s pt, w) ^ 

+ 2?7A^ + i<rjV(d + A max ) 2 + 2/V + ifAf(J. 

Therefore, the system is stable, and the average backlog is 
upper bounded by 

& i E ( E ^) + E ^ w + E ^ w) 

r— e,z c,i,j c,i,j 

(2d + l){KN(d + A max ) 2 + 27V + {2r, + l)KNd) 
~ e 
The proof is completed. 
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